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Sir : 

I, J. Hans van de Sande, Ph.D., a citizen of Canada, hereby declare and 

state : 

1. The curriculum vitae attached as Exhibit A accurately reflects my 
professional credentials. As noted in Exhibit A, I am presently Vice-Dean of 
the Faculty of Medicine as well as a Professor in the Department of 
Biochemistry & Molecular Biology (formerly known as the Department of Medical 
Biochemistry) at the University of Calgary in Calgary, Alberta, Canada. 

2. I am a paid consultant of Ingeneus Corp. through my association 
with Genetic Diagnostics, Inc., a licensee of technology owned by Ingeneus 
Corp. . I expect to be compensated for my time expended preparing this 
document . 

3. Prior to executing this Declaration, I reviewed the 
above-identified application, the May 30, 2002 Final Rejection, the January 
18, 2002 DECLARATION UNDER 37 C.F.R. § 1.132 of Jasmine I. Daksis, and the 
January 21, 2002 DECLARATION UNDER 37 C.F.R. § 1.132 of Richard A. Collins. 

4. The purpose of this Declaration is to address the assertions in 
the Final Rejection that the application: (a) does not enable one skilled in 
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the art to make and/or use the invention; and (b) the claimed multiplex 
structure lacks patentable utility. 

Enablement 

5. Counsel for Ingeneus Corp. has advised me that an invention is 
patentably enabled if one of ordinary skill in the art could make or use the 
invention from the disclosures in the patent application coupled with 
information known in the art without undue experimentation. While I am not 
an expert in patent law, my experience and educational background, 
particularly as a professor of molecular biology and biochemistry, enable me 
to render an informed opinion as to the facts underlying the determination of 
enablement, including the level of ordinary skill in the art, information 
known in the art at the time of the invention, and what constitutes undue 
experimentation to one of ordinary skill in the art. 

6. Claim 1 specifies a multiplex structure comprising four strands, 
wherein the first strand is associated with the second strand by Watson-Crick 
bonding, and the fourth strand is associated with the second strand and the 
third strand by Watson-Crick bonding. In addition, at least one nucleobase 
of the fourth strand is associated by Watson-Crick bonding to at least one 
nucleobase of the third strand and to at least one nucleobase of the second 
strand. The meaning of Watson-Crick bonding in the context of the invention 
is provided in the application at page 5, lines 25-33: 

As used herein, the term "Watson-Crick bonding" is intended to 
define specific association between opposing pairs of nucleic 
acid (and/or nucleic acid analogue) strands via matched, opposing 
bases. While the formation of a Watson-Crick quadruplex may 
sometimes be referred to as a hybridization event herein, that is 
merely for convenience and is not intended to limit the scope of 
the invention with respect to how the formation of a Watson-Crick 
quadruplex can be best characterized. 
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One of ordinary skill in the art would have understood from the foregoing 
that base claim 1 and the claims dependent therefrom are directed to a 
quadruplex of four nucleobase-containing strands, wherein Adenines align with 
Thymines (or Uracils) and Cytosines align with Guanines. 

7. The working examples of the specification show that Applicants 
were able to make and use the invention without undertaking detailed 
biophysical studies. Applicants have shown through binding studies specific 
association of non-denatured dsDNA targets with non-denatured dsDNA probes. 
The concentration of KC1 used in, e.g., Example 1 of the application was 
sufficiently high that it was highly unlikely that strand displacement 
occurred or that the dsDNA probes or dsDNA targets were denatured in any way. 
It is known that high concentrations of salt (e.g., 100 mM NaCl) inhibit 
strand invasion by virtue of the salt increasing the stability of dsDNA. 
See, e.g., Tomac et al., "Ionic Effects on the Stability and Conformation of 
Peptide Nucleic Acid Complexes, '.' J. Am. Chem. Soc . 118, 5544-5552 (1996) 
(previously made of record in the application on January 18, 2002) . 
Accordingly, one of ordinary skill in the art would have expected the 100 mM 
KCl concentration of Example 1 to prevent strand displacement and 
denaturation. 

8. In light of Applicants' evidence that two strands on opposing 
non-denatured duplexes specifically interact together A:T(U) and C:G, one of 
ordinary skill in the art would have found it reasonable to infer that 
adjacent bases in the remaining two strands of the duplexes would be brought 
into close enough proximity by the initial pairing of opposing strands to 
specifically interact together A:T(U) and C:G. This inference would not have 
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been considered by an ordinarily skilled artisan to be unreasonable, 
particularly in view of the teachings of, e.g., Zhang et al . , "Dimeric DNA 
Quadruplex Containing Major Groove Aligned A*T«A»T and G^C^G•C Tetrads 
Stabilized by Inter-subunit Watson-Crick A»T and G»C Pairs," 312 J. Mol . 
Biol. 1073-88 (Oct. 5, 2001), attached as Exhibit B. 

9. Zhang et al . disproves the theory in the Final Rejection at 
page 6, last sentence, that "Watson- crick hydrogen bonding surfaces are 
inaccessible for any other strands [i.e., strands other than the two 
hybridized strands of conventional duplexes] since two strands are already 
interacting with each other at the center of the double helix." Zhang et 
al., shows through NMR studies the formation of A-T-A-T tetrads similar to 
previously discovered G-C-G-C tetrads. Zhang et al. at pages 1073-74 states: 

[E] f forts have been made to identify and characterize G»C*G»C 
tetrads, where a pair of Watson-Crick G»C pairs can potentially 
align either through their major groove or their minor groove 
edges. . . . recent studies have demonstrated that G*C»G»C 
tetrads aligned through their major groove edges can switch 
between two distinct alignment geometries [shown in Figure 1 (a) 
and 1(b)]. . . . The major groove -aligned G•C^G^C tetrad has now 
been observed in a range of DNA quadruplexes and appears to be a 
robust tetrad motif adopted by a wide range of DNA sequences. 

Figure 1 of Zhang et al . shows how major groove -aligned G»C*G»C and A»T»A»T 

tetrads in their direct alignment geometry have each G hydrogen bonded to 
each C, and each A hydrogen bonded to each T. Thus, contrary to the Final 
Rejection, Zhang et al . and the art cited therein shows that quadruplex 
G-C-G-C and A-T-A-T binding is reasonably credible. 

10 . Thus , one of ordinary skill in the art (who has a high level of 
skill and a high tolerance for complex experimentation) would have been able 
to make and use with no more than routine experimentation the claimed 
multiplexes for specific and useful purposes such as assays without ever 
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knowing for certain the location, length and/or number of hydrogen bonds 
between adjacent bases in the multiplex. 

Utility 

11. As shown by Zhang et al., the art recognizes the existence of 
quadruplex G-C-G-C and A-T-A-T binding under certain conditions, contrary to 
the assertion in the Final Rejection. Applicants have shown through binding 
studies specific association of non-denatured dsDNA targets with 
non-denatured dsDNA probes, wherein the targets and probes align Adenine to 
Thymine (or Uracil) and Cytosine to Guanine. Accordingly, one of ordinary 
skill in the art would have found the claimed invention to be reasonably 
credible in view of the original disclosure of the invention and conventional 
wisdom in the art. 

12. Similarly, the binding studies described in the previously filed 
Daksis and Collins Declarations further show the reasonable credibility of 
the claimed quadruplexes through evidence establishing the existence of 
related Watson-Crick triplexes. Further evidence exists in the form of the 
inventors' previously issued U.S. Patents Nos . 6,420,115, 6,403,313 and 
6,265,170, which include many examples of Watson-Crick triplex binding, 
wherein Adenines align with Thymines (or Uracils) and Cytosines align with 
Guanines. Such evidence of triple-stranded Watson-Crick binding undercuts 
the theory that only duplex .Watson-Crick binding exists, and adds to the 
evidence that Watson-Crick quadruplexes are a reality. 

13. In addition, I have personally observed triplex hybridization 
experiments similar to those described in the working examples of the 
application, wherein single-stranded probes were able to discriminate between 
perfectly matched duplex targets and mismatched duplex targets under 
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non-denaturing conditions. The experiments further convinced me of the high 
degree of recognition between the single -stranded probe and the duplex target 
with complete base pairing between binding partners. 

I hereby declare that all statements made herein of my own knowledge 
are true, and that all statements made on information and belief are believed 
to be true; and further that these statements were made with the knowledge 
that willful false statements and the like so made are punishable by fine 
and/or imprisonment under Section 1001 of Title 18 of the United States Code, 
and that such willful false statements may jeopardize the validity of the 
application or any patent issuing therefrom. 
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Dimeric DNA Quadruplex Containing Major Groove- 
aligned A TAT and G C G C Tetrads Stabilized by 
Inter-subunit Watson-Crick A T and G C Pairs 
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We report on an NMR study of unlabeled and uniformly 'K^N-labeled 
d(GAGCAGGT) sequence in 1 M NaG solution, conditions under which 
it forms a head-to-head dimeric quadruplex containing sequentially 
stacked C C-G C CG-CG and A T A T tetrads- We have identified, 
for the first time, a slipped A T* A T tetrad alignment, Involving recog- 
nition of Watson-Crick A -T pairs along the major groove edges of oppos- 
ing adenine residues. Strikingly, both Watson-Crick GC and A T 
pairings within the direct G C G C and slipped A T A T tetrads, 
respectively, occur between rather than within hairpin subunits of the 
dimeric d(GAGCAGGT) quadruple*. The hairpin turns In the head -to- 
head dimeric quadruplex involve single adenine residues and adds to 
our knowledge of chain reversal involving edgewise loops in DNA quad* 
ruplexes. Our structural studies, together with those from other labora- 
tories, definitively establish that DNA quadruplex formation is not 
restricted to G„ repeat sequences, with their characteristic stacked uni- 
form GG-GG tetrad architectures. Rather, the quadruplex fold is a 
more versatile and robust architecture, accessible to a range of mixed 
sequences, with the potential to facilitate C'CG C and A TAT tetrad 
through major and minor groove alignment, in addition to G G G G tet- 
rad formation. The definitive experimental identification of such major 
groove-aligned mixed A-T-A-T and GCG-C tetrads within a quadru- 
plex scaffold, has important implications for the potential alignment of 
duplex segments during homologous recombination. 

tf\ 2001 Academic Press 

Keywords: A«T* A T and G C G-C tetrads; dimeric DNA quadruplex; 
hydrogen bond alignments; inter-subunit Watson-Crick pairs 



Introduction 

There is an increasing appreciation of the role 
DNA quadruplexes (reviews* -4 ) may play in bio- 
logical processes ranging from replication, tran- 
scription and recombination (review 5 ) to telomere 
function (review 4 ). The earliest research focused on 
quadruple* formation involving stacked G-G-G G 
tetrads (review*), with polymorphism Introduced 
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by the directionality of adjacent strands around the 
quadruplex (review 4 ). 

Formation of mixed tetrads could dramatically 
increase the versatility of quadruplex formation by 
allowing adoption of this four-stranded architec- 
ture by sequences other than simple G„ repeats. 
Towards this end, efforts have been made to Ident- 
ify and characterize G C G C tetrads, where a 
pair of Watson-Crick G C pairs can potentially 
align either through their major groove or their 
minor groove edges. This approach was successful 
in the case of the Fragile X syndrome triplet 
repeat-containing d(GCGGT 3 GCGG) sequence, 
which dimerizes in solution through head-to-tail 
alignment of hairpins. The resulting quadruplex 
contains G C G C tetrads, through major groove 
alignment of Watson-Crick G-C pairs (Figure 1(a)). 7 
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Figure 1. Potential G C*C C tet- 
rad formation Involving either (a) 
direct or (b) slipped alignment of 
major groove edges of a pair of 
Watson-Crick G-C pairs. The 
slipped alignment required a mono- 
valent cation to coordinate the 

inwardly n Hm teij s.cCopt or 3tGfii£. 

Potential A T- A T tetrad formation 
involving either (a) direct or (b) 
slipped alignment of major groove 
edges of a pair of Watson-CricV 
A T pairs. 



Independent structural studies have identified 
minor groove-aligned C*C G-C tetrads (Sup- 
plementary Material, Figure Sl(a)) in the crystal- 
line" and solution states/ 

More recent studies have demonstrated that 
G-C G-C tetrads aligned through their major 
groove edges can switch between two distinct 
alignment geometries. In the direct alignment 
(Figure 1(a)), the two G-C pairs align directly 
opposite each other, resulting in the acceptor 
atoms (06 and N7) of G pairing with the donor 
group (NH;r4) of C. Alternately, in the slipped 
alignment (Figure 1(b)), the major groove edges of 
two guanine bases are positioned opposite each 
other. In this case, a monovalent cation (not necess- 
arily in the plane of the tetrad) is needed to coordi- 
nate the four inwardly pointing nitrogen and 
oxygen acceptor atoms on the guanine bases 
(Figure 1(b)), Solution structural studies of the 
adeno associated viral repeat-containing 
d(GGGCT 4 GCGC) sequence have demonstrated 
that the direct G CGC tetrad alignment 
(Figure 1(a)) is formed in Na cation solution, 10 
while the slipped G-CG-C tetrad alignment 
(Figure 1(b)) is formed in K cation solution." 1 Such 
conformational transitions were unanticipated, and 
attest to the diversity of pairing geometries in 
mixed tetrad alignments and the role of cations in 
modulating this transition. 

The major groove-aligned G-C-G C tetrad has 
now been observed in a range of DNA quad- 
ruplexes 7 ' 10,12 and appears to be a robust tetrad 
motif adopted by a range of DNA sequences. One 
can therefore anticipate potential formati n of the 
corresponding major-groove aligned A-T-A-T 
counterpart, either of the direct (Figure 1(c)) or 



slipped (Figure 1(d)) alignment type. By contrast 
the corresponding minor groove-aligned A T- A T 
tetrad, of the slipped cation-coordination type 
(Supplementary Material, Figure Sl(b)), has been 
reported earlier both in the crystalline 1 * and 
solution 9 states. 

Demonstration of the major groove-aligned 
A'T-A T tetrad formation has turned out to be a 
considerable challenge, and after many unsuccess- 
ful attempts, our group has now identified a 
sequence, d(GAGCAGGT), which forms a head-to- 
head dimeric quadruplex (Figure 2a and Sup- 
plementary Material, Figure S2), stabilized by 
direct G-C-G-C (Figure 2(b)) and slipped 
A T A'T (Figure 2(d)) tetrads, flanking a central 
G'G G G tetrad (Figure 2(c)). Strikingly, we 
observe Watson-Crick G-C pairing between mono- 
mer subunits within the G-C-G-C tetrad and Wat- 
son-Crick AT pairing between monomer subunits 
within the A-T-A T tetrad in this quadruplex. 
Finally, the edgewise rums in the head-to-head 
dimeric quadruplex involve a single adenine resi- 
due and provicfe new insights into chain reversal 
in DNA quadruplexes. 

Results 

Our group scanned a range of sequences con- 
taining G, A and T residues in our attempts to gen- 
erate a major groove-aligned A T A T tetrad, 
stabilized through stacking on a G G G G tetrad 
within a quadruplex scaffold. This approach has 
worked in the past in our laboratory when we 
have generated stable triads"" 1M and 
A (G G G'G)-A hexads, 12 - 1 * stabilized through 
stocking n G-G'C'G tetrads within a quadruplex 






Figure 2. (a) Folding topology of 
a head-to-head dimeric d(GAG- 
CACGT) quadruple* in 1 M Nad 
The backbone tracing of the indi- 
vidual strands are shown by thick 
lines and the chain directional ity 
indicated by arrows. Pairing align- 
ments for (b) the G6 C4 G6 C4 tet- 
rad, <c) the G3-G7*G3-G? tetrad 
and (d) the A2 T8 A2TB tetrad. 



scaffold. We report on NMR studies of the d(GAG- 
CAGGT) sequence, which forms a dimeric quadru- 
ple* containing a major groove-aligned slipped 
A'T-A'T (Figure 2(c)) tetrad. 



Imlno proton spectra and assignments 

The exchangeable proton spectrum (53 to 14.5 
ppm) of the d(GAGCAGGT) sequence was highly 
dependent on added NaCl concentration. At low 
NaCl concentration, the predominant conformation 
gave very broad resonances in slow exchange with 
a minor component that gave narrow resonances 
(Supplementary Material, Figure S3). The equili- 
brium shifted with Increasing NaCl to the confor- 
mation that exhibited narrow resonances, resulting 
in the spectrum shown in Figure 3(a) in 1 M NaCl, 
2mM phosphate buffer (pH 6.6) at 10 W C ThiB 
spectrum exhibits well resolved exchangeable (8*0 
to 15.0 ppm) and non-exchangeable (7.0 to b\7 
ppm) resonance**, with the total number of imino, 
amino and base proton resonances consistent with 
formation of a single conformer in solution. We 
observe two narrow imino protons between 11 and 
12 ppm, a region characteristic of N-H ■ • -O hydro- 
gen bonds/ and narrow resonances at 13*11 ppm 
and 14.28 ppm, a region characteristic of N-H* -N 
hydrogen bonds 4 (Figure 3(a)), We also observe 
two narrow amino proton resonances between 8.0 
and H.5 ppm and two additional narrow amino 



proton resonances downfleld-shifred between 9.0 
and 9.2 ppm (Figure 3(a)). 

The exchangeable imino and amino protons 
have been assigned following analysis of two- 
dimensional data sets on unlabeled and uniformly 
,3 C, l4 N-labeled d(GAGCAGGT) sequence. An 
expanded 60 ms mixing time NOCSY contour plot 
of d(GAGCAGGT) in 1 M Nad at 0"C is plotted 
in Figure 3(b). The NOE cross-peaks are labeled in 
the Figure and the assignments are given in the 
legend. The corresponding NOESY data set 
recorded at a longer mixing time of 200 ms is 
plotted in Supplementary Material, Figure S4, and 
the cross-peaks assignments listed in the legend. 
We could readily distinguish thymine from gua- 
nine imino protons because of their distinct nitro- 
gen chemical shifts in a 'H- 15 N HSQC spectrum 
(Supplementary Material, Figure 55). 

We establish formation of an A2-T8 Watson- 
Crick base-pair based on NOEs between the imino 
proton of T8 and the amino (peaks a and a', 
Figure 3(b)) and H2 (peak b, Figure 3(b)) protons 
of A2. In addition, we unexpectedly observe NOEs 
between the H8 and NH2 protons of A2 (peaks n 
and n', Figure 3(b)), consistent with formation of a 
major groove-aligned slipped A2'T8*A2»T8 tetrad, 
schematically outlined in Figure 2(d). 

We establish formation of a G6-C4 Watson- 
Crick base-pair based on NOEs between the imino 

proton of G6 and the amino protons of C4 (peaks c 
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Figure 3. (a) Proton NMR spectrum (7.0 to 13.0 ppm) of the d(GAGCAGGT) quadruplex (8.8 mM in strands) in 
\ M NaCI, 2 mM phosphate in H a O (pH 6.6) at 10°C (b) Expanded NOESY (60 ms mixing time) contour plot corre- 
lating NOEft between Imlno, amino and non-exchangeable protons in the d(GAGCAGGT) quadruplex in 1 M NaCI, 
2 mM phosphate, H,0 (pH 6.6) at 0°C The NOE crow-peaks a to p are assigned as follows: a and a', T8(NHs1)- 
A2(NH^6); b, T8(NH3)-A2(H2); c and C, G6(?sJHl)-C4(NH 2 4); d and d', G3(MHlK33(NH r 2); e, G3(NH1)-G7(H8); f 
and f, G7(NHl)<i7(NH.-2); g (observable In 200 ms mixing time NOESY experiment), G7(NH1)-C3(H8); h, G7(NH,- 
2)<i7(NTV2); U G3(NHr2)-G3(NH 2 -2); j and f, G3(NH 2 -2K;7(H8); k, C4(NH 2 -4)-C4(NHz-4); 1 and 1', C4(NH 2 -4>- 
G6(H8); m and m\ C4(NH a -4>-C4(H5); n and rf , A2(NH 2 -6)-A2(H8); o, A2(NH 2 -6)-A2(NH 2 h6); p, G6(H8)-C4(H5). 



and <f t Figure 3(b)). We also detect NOHs between 
the amino protons of C4 and the H8 proton of G6 
(peaks I and 1', Figure 3(b)), and between the H5 
proton of C4 and the H8 proton of G6 (peak p, 
Figure 3(b)), consistent with formation of a major 



groove-aligned direct G6 C4 G6 C4 tetrad, sche- 
matically outlined in Figure 2(b). 

The narrow G3 and G7 imino protons exhibit a 
set of NOEs indicative of C3-G7G3G7 tetrad for- 
mation, schematically outlined in Figure 2(c). 
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These include NOEs between the H8 proton of G7 
with the imlno (peak e, Figure 3(b)) and amino 
(peak j and J 1 , Figure 3(b)) protons of G3, and an 
NOE between the H8 proton of G3 and the imino 
proton of G7 (box g, Figure 3(b); observable In 
200 ms NOliSY spectrum, peak g, Supplementary 
Material, Figure S4). 

The NOL r data outlined above for the d(GAG- 
CAGGT) sequence are consistent with formation of 
slipped Aj.T8.A2.T8 (Figure 2(d)), direct 
G6*C4'G6-C4 (Figure 2(b)) and G3G7G3G7 
(Figure 2c) tetrads. This can be most readily 
achieved by head*to-head dimerization to form a 
quadruple* as *hown schematically in Figure 2(a) 
and Supplementary Material Figure S2. The 
exchangeable imino and amino proton chemical 
shifts of the d(GAGCAGGT) quadruplex are listed 
in Supplementary Material, Table SI, 

Non-exchangeable proton spectra 
and assignments 

The non-exchangeable base and sugar protons 
have been assigned following analysis of two- 
dimensional data sets on unlabeled and uniformly 
,: *C, ,s N-labeled d(GAGCAGGT) sequence. An 
expanded 250 ms mixing time NOESY contour 
plot of d(GAGCACGT) quadruplex in 1 M NaCI, 
*H 2 0 buffer at 10 °C is plotted in Figure 4(a), along 
with a tracing of distance connectivities between 
base protons and their own and 5'-flanklng sugar 
HI' protons from Gl to T8 in the sequence. The G3 
residue adops a syn alignment based on the strong 
H8 to its own HI' NOE in a 50 ms mixing NOESY 
stacked plot (Figure 4(b)). The remaining sugar 
proton chemical shifts were obtained from an anal- 
ysis of other regions of the through space NOESY 
contour plot and through bond COSY and TOCSY 
correlations of the assigned sugar Hi' protons with 
the remaining sugar protons within individual 
rings. The non-exchangeable base and sugar pro- 
ton chemical shifts of the d(GAGCAGGT) quadru- 
plex are listed in Supplementary Material, Table S2. 
The H8 proton of A2 (8,71 ppm) Is downfield- 
shifted while the H6 (6.78 ppm) and H5 (5.15 ppm) 
protons of C4 are somewhat upfield-shifted. 

We can also correlate exchangeable imino pro- 
tons with their own H8 protons within individual 
guanine bases, via through bond correlations to 
their C5 ring carbon atoms. 2 " Such a correlation 
experiment on the sample of uniformly ia C, 1!fc N- 
labeled d(GAGCAGGT) sequence is shown in 
Figure 4(c), 

Hydrogen bond alignments 

We have verified the formation of NOE-based 
G6 C4 • G6« C4 (Figure 2(b)), G3 G7- G3 G7 
(Figure 2(c)) and A2 T8- A2 T8 (Figure 2(d)) tetrad 
alignments, by identifying through bond coupling 
connectivities across N-H- -N hydrogen bonds*-* 
within the folded architecture of the uniformly 

'*C, 1f <N-labeled d(GAGCACCT) quadruplex In 1 M 



NaCl, H a O buffer at 0°C The amino protons of 
A2, G3 and C4 could be correlated with their 
directly attached nitrogen atoms using this labeled 
sample as shown in Supplementary Material 
Figure S6. 

We observe a N-H • * N hydrogen bond connec- 
tivity between the N1H aonor of G6 and the 
N3 acceptor of C4 (peak 4, Figure 5(a)) in a 
HNN-COSY experiment 24 recorded on the d(GAG- 
CAGGT) quadruplex. This connectivity provides 
direct support for a Watson-Crick G6C4 align- 
ment (observed couplings as shaded region in 
Figure 2(b)). In addition, we observe HNN-COSY 
coupling connectivities between the NH 2 protons 
of C4 and the N7 of G6 (peaks 1 and V, Figure 5(b)) 
and four bond H(CN)N(H) coupling con- 
nectivities 24 between the N4 of C4 and the H8 of 
G6 (peak 3, Figure 5(c)). These observations verify 
the direct alignment of opposing Watson-Cricx 
G6C4 pairs through their major groove edges 
(observed coupling as shaded region In 
Figure 2(b)), to form the G6C4G6C4 tetrad out- 
lined in Figure 2(b). 

The observed H(CN)N(H) coupling con- 
nectivities 24 between the H8 proton of G7 and the 
N2 of G3 (peak 1, Figure 5(c)) and between the H8 
proton of G3 and the N2 of G7 (peak 2, Figure 5(c)), 
verify the proposed hydrogen bonding patterns 
(shaded regions in Figure 2(c)) around the 
G3 G7 G3 G7 tetrad. 

We observe a N-H - -N hydrogen bond connec- 
tivity between the N3H donor of T8 and the Nl 
acceptor of A2 (peak 2, Figure 5(a)) in a HNN- 
COSY experiment. This connectivity provides 
direct support for a Watson-Crick A2-T6 align- 
ment, in addition, we observe HNN-COSY coup- 
ling connectivities between the NH a protons of A2 
and the N7 of A2 (peaks 2 and 2', Figure 5(b)). 
This connectivity could be either within an adenine 
or between adenine bases positioned opposite each 
other and hydrogen-bonded through their major 
groove edges, as shown for the A2 T8 A2 T8 tet- 
rad alignment in Figure 2(d). We do not observe 
the corresponding coupling connectivities between 
the NH 2 and N7 of A5, and hence we favor the lat- 
ter explanation over the former. These observations 
verify the slipped alignment of opposing Watson- 
Crick A2-T8 pairs through their major groove 
edges, to form the A2 T8-A2.T8 tetrad outlined in 
Figure 2(d). 

Intra-strand verou* Inter-strand NOE restraints 

The 2-fold symmetry in the dimerlc d(GAG- 
CAGGT) quadruplex fold makes it critical to 
unambiguously differentiate between intra-strand 
and Inter-strand contributions for key NOEs that 
define the folding topology in solution. We pre- 
pared a sample containing an equl molar mixture 
of unlabeled and unil Formly ,s C, ,s N-labeled 
d(GAGCAGGT) sequences and recorded ls N-edi- 
ted (ca,), ,s C, , ^J-purged (»,) NOKSY"^ (100 ms 
mixing rime) spectra in 1 M NaO-eontainlng HjO 
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Figure 5. Identification of N- 
H"N hydrogen bond alignments 
in uniformly 4a C, "NT-labeled 
d(GAGCAGGT) (5.? mM in 
strands) in 1 M NaCl, 2 mM phos 
phate, H^O (pH 6.6) at 0 U C 
Pvpan^Aj »j w HNN-CQSY eon- 
tour plots correlating two-bond 
coupling connectivities between 
donor and acceptor nitrogen atoms 
within N-H-N hydrogen-bond 
alignments across base -pairs, (a) 
Two-bond coupling connectivities 
between T8(N3) Imlno donor and 
A2(N1) acceptor nitrogen atoms 
(peak 2) across the A2(N1) TB(N3 
donor) Watson-Crick base-pair and 
between G6(N1) imino donor and 
C4(N3) acceptor nitrogen atoms 
(peak 4) across the C4(N3).G6(Nl 
donor) Watson-Crkk base-pair, (b) 
Two-bond coupling connectivities 
between C4(N4) amino donor and 
G6(N7) acceptor nitrogen atoms 
(peaks 1,1') across C4(N4 ami- 
no) G6{N7) mismatch pair and 
between A2(N6) amino donor and 
A2(N7) acceptor nitrogen atoms 
(peaks 2,2') across A2(N6 ami- 
no) • A2(N7) misma tch pa i r. (c) 
H(CN)N(H) spectrum showing 
inter-nucleotide H8(w i )-N2(co ) ) and 
Hd(cD } )*N4({0 ] ) cross-peaks. The 
spectrum consisted of 608 (t 2 ) * 60 
(/,) complex points, with 160 transi 
ents per FID. Spectral widths of 
2000 Hz (f,max: 30 ms) and 
8000 Hz (/ a max: 76 ms) were used, 
with a relaxation delay of two 
seconds, resulting in a total acqui- 
sition of 12 hours. Counting con- 
nectivities are observed between 
the H8 of G7 and the N2 of G3 
(peak 1), between the H8 of C3 
and the N2 of G7 (peak 2), as well 
as between the H8 of G6 and N4 of 
C4 (peak 3). A coupling connec- 
tivity was not detectable between 
the H8 of A2 and the N6 of A2. 



buffer solution at 0 4, C, to identify Inter-strand 
NOliB and differentiate them from their intra- 
strand counter pnrts, for the 50% component in the 



mixture where the quadrupl x contains one 
unlabeled and one uniformly labeled strand. We 
observe inter-strand NOEs between the Imlno pro- 
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ton of T8 and the amino (peaks 1 and 1', 
Figure 6(a); peaks 1 and 1', Figure 6(b)) and H2 
(peak % Figure 6(b)) protons of A2, establishing 
formation of Watson-Crick A2 T8 pairs between 
hairpin subunits in the dimeric d(GAGCAGGT) 
quadruple*. We also observe inter-strand NOEs 
between the imlno proton of G6 and the amino 
protons of C4 (peaks 2 and 2', Figure 6(a); peaks 3 
and 3', Figure 6(b)), establishing formation of Wat* 
son-Crick G6 C4 pairs between hairpin subunits in 
the dimeric d(GACCAGGT) quadruples 

We also observe inter-strand NOEs between the 
amino protons of A2 and the H8 proton of A2 
(peaks 4 and 4', Figure 6(a)), putting restraints on 
tne relative alignments of the A-T base-pairs 
around the AT AT tetrad. Interstrand NOEs are 
also observed between the imino proton of 07 and 
the H2 proton of A2 (peak 4, Figure 6(b)), putting 
restraints on the relative stacking of adjacent 
G3 G7 G3 G7 and A2 T8 A2 T8 tetrads in the 
quadruple*. 

Distance restraints and molecular 
dynamics calculations 

Distance restraints associated with exchangeable 
protons (total of 115) were qualitatively deduced 
from NOESY experiments in H 2 0 at two mixing 
times, while those associated with their non- 
exchangeable proton counterparts (total of 178) 
were quantified from NOD buildup curves in 2 H 2 0 
at four mixing times, as outlined in Materials and 
Methods. The observation of a single set of narrow 
resonances for the d(CACCAGGT) sequence at 
temperatures down to 0°C, was consistent with 
formation of a dimeric quadruplex, containing two 
strands related by a 2-fold symmetry axis. There- 
fore, non<rystallographlc symmetry restraints 
were used during the computations. All distance 
restraints were classified as ambiguous during the 
distance-restrained molecular dynamics compu- 
tations. Experimentally defined hydrogen bonding 
alignments from NOE patterns on unlabeled 
sample, N-H-N scalar couplings on isotopically 
labeled sample, and intermolecular NOEs on an 
equimolar mixture of labeled and unlabeled 
sample, were used to restrain the G6*C4»G6«C4 
(Figure 2(b)), G3G7G3G7 (Figure 2(c)) and 
A2 T8 A2 T8 (Figure 2(d)) tetrads, with the fold- 
ing models retaining these hydrogen bonding 
alignments during the computations. 

The solution structure of the dimeric d(CAG- 
CAGGT) quadruplex In 1 M NaCl was solved by 
molecular dynamics computations guided by 
hydrogen bonding and NOE distance restraints. 
Sixty starting structures were generated for the 
d(GAGCACGT) 8-mer segment as sets of pairs of 
randomized chains separated by space Intervals of 
5U A. The protocol outlined in Materials and 
Methods involved Initial torsion space dynamics at 
20,000 K followed by Cartesian space dynamics at 
300 K. A subset of ten distance-refined structures 
of the d(GACCAGGT) quadruplex were identified 



based on a combination of low NOE energies and 
fewest NOE violations. 

Intensity restraints and NOE back calculations 

The subset of ten converged distance-refined 
structures were next refined againBt the non- 
exchangeable proton NOE intensities associated 
with NOESY spectra recorded at four mixing 
times. These computations utilized o molecular 
dynamics with back calculation protocol outlined 
in Materials and Methods. The NOE violations, 
deviations from covalent geometry and pairwise 
r.m.s.d. values for the ten lowest energy intensity- 
refined structures of the d(GAGCAGGT) quadra 
plex (less the poorly defined Gl residues) are listed 
in Table 1. 

Siruciurai i«aiufm 

A stereo view of the ten superpositioned lowest 
energy intensity refined structures of the d(GAG- 
CAGGT) quadruplex (less the poorly defined Gl 
residues) is shown in Figure 7(a). The sugar-phos- 
phate backbone of Individual symmetry-related 
hairpins are colored in orange ana green, with the 
sequentially stacked loop A5 residue, 
G6'C4*G6 C4 tetrad, G3 G7 G3 C7 tetrad and 
A2 T8 A2 T8 tetrad colored in white, magenta, 
yellow and cyan, respectively. Ribbon and surface 
views of one representative refined structure of the 
d(GAGCAGGT) quadruplex using the GRASP pro- 
gram are plotted in Figure 7(b) and (c), respect- 
ively. 

A stick representation of one symmetric Btrand 
of the d(GAGCAGGT) quadruplex (less the poorly 
defined Gl residues) Is shown in Figure 8(a). The 
pairing alignments of the G6-C4 G6-C4 tetrad and 
A2 T8 A2 T8 tetrad in this representative struc- 
ture are shown in Figure 8(b) and (c), respectively. 

The stacking geometries between the A5 bases 
(in whJte) with the G6<C4*G6<C4 tetrad (in 
magenta) is shown in Figure 9(a), while that 
between the G6-C4-G6-C4 tetrad (in magenta) 
and the G3G7C3G7 tetrad (in yellow) is shown 
in Figure 9(b). The overlap geometry between the 
G3«G7.G3.C7 tetrad (in yellow) and the 
A2 T8 A2 T8 tetrad (in cyan) is shown in 
Figure 9(c). 

Discussion 

The NMR-based structural characterization of 
the d(GAGCAGGT) quadruplex was considerably 
aided by the narrow and well- resolved resonances 
of the one and two-dimensional proton NMR spec- 
tra, and the availability of uniformly ia C, N* 
labeled sequence, which aided greatly in the reson- 
ance assignment and identification of hydrogen 
bonding alignments. The computations were also 
aided by our ability to distinguish between inter- 
strand and infra-strand NOEs (and in turn, Wat- 
son-Crick hydrogen bonding alignments) using 
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Table 1. NMR restraints and structural statistics for the Intensity refined structures of the dimeric d(GAGCAGGT) 
quadruple* 

A, NMR reniftiintv 
Distance rwrralnhi* 

IntrA-rwlduu distance retrain** 

Sequential 0, r + 1) di»Unce restraints 

tang range 2>(h ' + 2) distance rostra into 
Other restraints 

Hydrogen bonding mitraim** 

Torsion angle restraints' 
Intensity restraints' 

Nun -exchangeable proton* (each of 4 mixing tirnw) 

B. Structural tlttliMio W camplnc faltowius tnlttirtty rtflnemml 
NOli violation* 

Number >0.2 A 
Maximum violations 
r.m.»d. o( violations 
NfMK R factor (K t/ J 

Deviations from Went covalont geometry 
Bond lengths 
IJond angle* 
iinpropers 

PoirwUe nil heavy atom r.ro.*.d. values (10 refined Btntfotrtat) 
All heavy atom* excluding CI 



Nun-oxchangoable 




Exchang 


86 






76 




45 


16 




51 




48 






14 






178 





6.1 ±0.7 
0.27*0.03 
0.05640.02 
0.074*0,003 

0.011*0001 
3.70*0.09 
0.40*0.03 

0,78*0.17 



•All distance rr»tratnh worn srt as ambiguous between intra and inter-residue contribution*. 
b These hydrogen bonding restraint* arc bBsed on experimental NOB and j /nn coupling data. 

' Kcsiduei A2, C4. A5, C6, G7 and T8 were restrained to t values in the 2KX±40K range, characteristic of anh glycoeldk- tomon 
angles, while residues C3 wen? restrained to x values in the 65<±40)* range, characterise of syn glycosidic torsion angles, identified 
ex perl men tally. _ 



equlmolar mixtures of labeled and unlabeled 
samples of the quadruplex. 

Quadruple* architecture 

The experimental data on the d(GAGCAGGT) 
sequence i9 consistent with formation a dimeric 
quadruplex In 1 M NaCI solution. Individual 
strands of d(GAGCAGGT) form hairpins, with a 
single adenine, A5, involved In chain reversal 
Dimerizarion involves head-to-head orientation of 
hairpins (Figures 2(a) and 7; Supplementary 
Material, Figure SI), with individual strands run- 
ning antiparallel relative to their neighbors around 
the quadruplex (Figure 2(a)). 

G G G G tetrad 

We detect a G3(syn) G7(tf«//)-G3(sy;i) G7(OM/0 
alignment around the G-C-G-G tetrad in the 
dimeric d(GAGCAGGT) quadruplex. Such an 
alignment about the CGCG tetrad is consistent 
with antiparallel alignment of adjacent strands 
around the quadruples and has been reported for 
the solution structures of the thrombin-binding 
DNA aptamer quadruplex, 27 ' 2 '* as well as the 
d(GCGGT 3 GCGG) Fragile X syndrome-containing 
sequence quadruplex 7 and the d(GGGCT 4 GGGC) 
adeno-associated virus-containing sequence quad- 
ruplex. 10 

G C G C tetrad 

The G6C4C6C4 tetrad (Figures 2(b) and 8(b)) 
is defined by all anti glycosidic torsion angles, con- 



sistent with antiparallel arrangement of adjacent 
strands around the dimeric d(GAGCAGGT) 
quadruplex. This type of direct G C G C tetrad 
alignment (Figure 1(a)) has been observed pre- 
viously for the solution structures of the 
d(GCGGT 3 GCGG) Fragile X syndrome sequence- 
containing quadruplex 7 and the d(GGCCT 4 GGGC) 
adeno-associated virus sequence-containing quad- 
ruple* 10 There is, however, a critical distinction in 
that Watson-Crick G6-C4 pairs are formed 
between rather than within hairpin subunits 
(Figure 8(b)) in the head-to-head dimeric d(GAG- 
CAGGT) quadruplex reported here. This contrasts 
with Watson-Crick G C pairs within hairpin 
subunits in the head-to-tail dimeric 
d(CCCGT 3 CCGG) 7 and d(GGCCT 4 GGGC) 10 quad- 
ruples, reported previously. 

The amino protons of C4 in the d(GAGCAGGT) 
quadruplex resonate at 8.49 ppm and 9.23 ppm. 
The chemical shift of 8.49 ppm is characteristic of a 
cytoslne amino proton hvdrogen-bonded to a car- 
bonyl oxygen atom, such as would occur with a 
guanine residue in a Watson-Crtck GC pair. The 
chemical shift of 9.23 ppm Is consistent with the 
other cytosine amino proton also being hydrogen 
bondecf, and its much larger downfield shift may 
be reflective of its acceptors being both a ring nitro- 
gen and carbonyl oxygen, such as would occur in 
a direct GCCC tetrad (Figure 1(a)). 



A T A T tetrad 

The slipped A2 T8 A2 T8 tetrad (Figures 2(d) 
and 8(c)) contains all mil glycosidic torsion angles, 
with hydrogen bonding along the major groove 
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Figure 6- Identification of Inter-strand NOEs. An expanded ,fi N^edited (u,), nc, "N-purged (co,) NOESY (100 ms 
mixing time) contour plot of a 1:1 mixture of unlabeled and uniformly 13 C, 1(, !M-labeled d(GACCAGGT) (3,3 mM in 
grand*) in 1 M VaCl, 2 mM phosphate, H a O (pH 6.6) at 0"C (a) Carrier la on amino- ,s N. The relevant cross-p**? 
identifying Inter molecular NOBs are aligned as follows; 1,1', T8(NH3)-A2(NH r 6); 2,2', C&{NHl>-C4(NH 2 -4); V, 
C7(NHlVA2(NH,-6); and 44', A2(H8)«A2(NH 2 -6). (b) Carrier is on lmino- ,s N, The relevant cross-peaks identifying 
inter-molecular NOEs are assigned as follows: 1,1', T8(NH3)-A2(NH 3 ^6); 2, TB(NH3)-A2(H2); 3,3', G6(NHl)-C4(NH r 
4); 4, G7(NHl)-A2(H2). 



edges of opposing adenine bases. This is the first 
report of arty type (direct or slipped) of an 
ATAT tetrad alignment, which involves pairing 
along the major groove edges. This A-T-A T pair- 
ing alignment brings the H8 proton of one adenine 
in close proximity to the NH 2 proton of its adenine 
partner (Figure 1(d)), consistent with the observed 
intermolecular NOE between these pair of protons 
in mixed labeling experiments (peak 4 and 4', 
Figure 6(a)). Equally important, la our demon- 
stration that Watson-Crick A2T8 pairB are formed 
between rathurr than within hairpin subunlts 
(Figure 8(c)) in the head-to-head dimeric d(GAG- 
CAGGT) quadruplex. 

The amino protons of A2 in the d(GAGCAGGT) 
quadruplex renonate at 7.80 ppm and 9.16 ppm. 
The chemical shift of 7.80 ppm Is characteristic of 
an adenine amino proton hydrogen-bonded to a 
carbonyl oxygen atom, such as would occur with a 
thymine residue In a Watson-Crick A-T pair. The 
chemical shift of 9.16 ppm is consistent with the 
other adenine amino proton also being hydrogen 
bonded, and its much larger downfield shift is 

reflecMvo of Its acceptor being a ring nitrogen, such 



as would occur in a slipped A T A T tetrad 
(Figure 1(d)). 



Single base chain reversal 

Chain reversal involves a single adenine, A5, 
within individual hairpin subunits (Figure 8(a)). 
The A5 base is bracketed by a closing G6 C4 mis- 
match involving pairing between the amino group 
of C4 and the acceptor atoms along the major 
groove edge of G6 (Figure 2(b)). This concept of a 
single base hairpin loop bracketed by a mismatch 
pair, was first reported for 6ingle base loops closed 
by sheared GA mismatches in antiparallei 
duplexes. 29 ' 30 There is no pairing between A5 resi- 
dues across from each other (Figure 9(a)) in the 
hend-to-head hairpin dimeric d(GAGCAGGT) 
quadruplex. Rather, the AS residues are stacked 
over the adjacent C4 residues (Figure 9(a)), 

We have checked for non-standard backbone 
torsion and sugar pucker pseudo-rotation (P) 
angles within the single residue edgewise turn 
spanning the C4-A5-G6 segment amongst the 
refined structures of the dimeric d(GAGCAGGT) 

quadruple*. For the C4-A5 step, the P value for C4 
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Figure (a) Superpoaitioned stereo view of ten Intensity refined structure* of the dimerfc d(GAGCAGGT) quad- 
ruptex. The backbones of the individual strand9 ere in orange and green with phosphate oxygen atoms removed 
for clarity. The A5 residue 1b in white, the G6-C4 G6-C4 tetrad in magenta, G3G7G3G7 tetrad in yellow 
and A2 T8 A2 <T8 tetrad in cyan, (b) A GRASP ribbon view of a representative intensity refined structure of the 
d(GAGCACCT) quadruples. The backbones of Individual strands are colored in blue and red. Guanine, adenine and 
thymine bases on? colored yellow, red and blue, respectively, (c) A GRASP surface view of a representative Intensity 
refined structure of the d(GAGCAGGT) quadruples 



of 69.8(^5.7) " places it in the C4'-«o range, the y 
value of 149.9(^40.1)" places it in the trans range 
(in contrast to standard gauche + value of 64°), and 
the x value for A5 of I70\l(±4.6) fl places it in the 
low anti range. 



Base stacking 

There is excellent pyrim id ine/ purine and pur- 
ine/ourine stacking between the bases of the direct 
G6-C4-G6-C4 tetrad (in magenta) and the 
G3 G7 C3 G7 tetrad (in yellow) In the folded top- 
ology of the d(GAGCAGGT) quadruple* 
(Figure 9(b)). We observe partial stacking between 
the bases of trw G3G7G3G7 tetrad (in yellow) 
and the slipped A2 T8 A2 T8 tetrad (in cyan) in 
the folded topology of the d(GAGCAGGT) quad- 
ruplex (Figure 9(c)). Overall, there is significant 
stacking of bases along the length of the dimeric 
d(GAGCAGGT) quadruples 

The stacking pattern between the G3G7G3G7 
tetrad (In yellow) and the slipped A2 T3 A2 T8 
tetrad (in cyan) (Figure 9(c)), positions the imlno 
proton of G7 of one strand in close proximity to 
the NH 2 and H2 protons of A2 of the partner 
strand, consistent with the observed inter-strand 
NOEs (peaks 3 and 3', figure 6(a); peak 4, 

Figure 6(b)) in mixed labeling experiments. 



Inter-strand NOEs 

We have observed a set of weaker inter-strand 
NOEs, in addition to their stronger counterparts/ 
shown in Figure 6(a) and (b), that merit further dis- 
cussion. The studies presented in Figure 6 on an 
equimolar sample of unlabeled and uniformly 
'^C^N-labeled d(GAGCAGGT) quadruplex in 1 M 
NaCl-containing H a O buffer, were recorded at 
O'C. We have also collected a "N-edited (to,), 
>\:, ,s N-purged (ttj NOHSY 26 (100 ms mixing 
time) spectrum, with the carrier on the imino- 1a N, 
recorded at 10*C. This spectrum had improved 
signal-to-noise, permitting data presentation at a 
lower contour level (as shown bi Supplementary 
Material, Figure S7). We observe a set of weak 
inter- st rand NOE cross-peaks labeled 5 to 9 in Sup- 
plementary Material, Figure S7, in addition to 
stronger peaks 1 to 4, that were also seen in 
Figure 6(b). 

Peaks 5 and 5', assigned to inter-strand NOHs 
between the imino proton of G7 and the amino 
protons of C4, are consistent with the stacking pat- 
tern shown in Figure 9(b). Peaks 6 and 6', assigned 
to inter-strand NOEs between the imino proton of 
G7 and the amino protons of A2, are consistent 
with the stacking pattern shown in Figure 9(c). 
Peak 9, assigned to an inter-strand NOE between 
the imino of G6 and the H2 of A5, is consistent 
with the stacking pattern shown in Figure 9(a). 
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Figure 8. (a) A stick representation of a symmetric 
half (corresponding to one of the two strands) of a 
representative intensity refined structure of the d(GAG- 
CAGGT) quadruple*. The color code is the same as out- 
lined in the legend to Figure 7. (b) Pairing alignment of 
the G6C4G6 C4 tetrad, (c) Pairing alignment of the 
A2 TB A2 T8 tetrad. 



By contrast peak 7 assigned to an inter-strand 
NOl: between the imino of G3 and the H8 of A2, 
cannot be explained by the overlap pattern in 
Figure 9(c); Peak 8, representing an Inter-strand 
NOIZ assigned to the imino of G3 and the H8 pro- 
ton of C6/G7, cannot be explained by the overlap 
pattern shown in Figure 9(b). in both cases, while 
the inter-strand distances between proton pairs are 
long, their intra-strand counterparts are in close 
proximity. Thus, it could be argued that perhaps 
peaks 7 and 8 reflect relatively strong intra-strand 
NOiis that are not completely removed by isotope 
filtering. For this explanation to be valid, peaks 7 
and H flhnulH have hcon doublets, because 1: *C 



Figure 9. Base stacking overlap patterns in a represen- 
tative intensity refined structure of the d(CAGCAGGT) 
cpdruplex. (a) Stacking of AS (in white) on the 
C6 C4 G6 C4 tetrad (in magenta), (b) Stacking of the 
Go C4 G6-C4 tetrad (in magenta) on the G3 G7 G3-G7 
tetrad (in yellow), (c) Stacking of the G3 G7 G3 G7 tet> 
rad (in yellow) on the A2 TB« A2-TB tetrad (In cyan). 



decoupling was not acquired during acquisition. 
Currently, we are somewhat at a loss to account 
for this discrepancy inv lving weak peaks 7 and 8 
(Supplementary Material, Figure S7). whnu> inten- 
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sity did vary in the weak range when the exper- 
iments were checked for reproducibility. 

Salt dependence 

The broad resonances for the d(GAGCAGGT) 
sequence in low NaCl solution (Supplementary 
Material, Figure S3(a)) is indicative of aggregate 
formation. What was unexpected was the tran- 
sition to narrow resonances in 1 M NaCl solution 
(Supplementary Material, Figure S3(b)), resulting 
in the formation of a dlmeric d(GAGCAGGT) 
quadruplex, amenable to structural characters* 
aruuv Both the direct G-C-G-C (Figure 1(a)) and 
slipped A T- A T (Figure 1(d)) tetrads have 
inwardly pointing amino groups, an alignment not 
conducive to coordinating monovalent cations 
positioned either between or within tetrad planes. 
Therefore, it u; conceivable that 1 M NaCl has 
minimal effect on the integrity of the dimeric 
d(GAGCAGGT) quadruplex, but for some 
unknown reason may destabilize the aggregated 
species that predominates in low salt solution. 

Comparison with mixed tetrada Involving 
minor groove alignment of G C and A T pairs 

Our structural studies on major groove-aligned 
direct GCG-C and slipped AT. AT tetrads in 
the head-to-head dimeric d(GAGCAGGT) quadru- 
plex (this study) can be compared with reported 
structural studies on quadruplexes involving 
minor groove-aligned direct G*C*G*'C (Sup- 
plementary Material, Figure Si (a)) and slipped cat- 
lon-coordianted AT. AT (Supplementary 
Material, Figure Sl(b)) tetrads formed oy dimerwa- 
Ifon of linear and cyclic octameric sequences.*'* 1 ' 14 
Interestingly, the minor groove-aligned direct 
G'C'G-C (Supplementary Material, Figure Sl(a)) 
and slipped a\ tion-coord ina ted A ■ T • A T (Sup- 
plements ry Material, Figure Sl(b)) tetrads also 
involve Watson-Crick G-C and AT pairing 
between rather than within head-to-head hairpin 
dimers of the quadruplex. 

It should be noted that while the major groove- 
aligned direct G C GC (Figure 1(a)) and slipped 
AT* AT (Flgui-e 1(d)) tetrads adopt planar geo- 
metries in the solution structure of the dimeric 
d(GAGCAGGT) quadruplex (this study), the minor 
groove"a!igned direct G*C*G*C (Supplementary 
Material, Figure S2(a)) and slipped cation-coor- 
dianted AT* AT (Supplementary Material, 
Figure S2(b)) tc-trads adopt distinctly non-planar 
geometries within their quadruplexes, both in the 
crystalline* 14 and solution* states. 

Biological relevance 

We have reported the first experimental demon- 
stration for formation of a major groove-aligned 
slipped A T A T tetrad (Figure 1(d)) within a 
DNA quadruplex architecture. This result, together 
with our previous demonstration of major groove- 



aligned direct GCGC tetrad (Figure 1(a)) 7 for* 
motion within a quadruplex architecture, expands 
on the code f r alignment of duplex segments that 
can potentially participate in strand exchange 
during homologous recombination. The glycosidic 
bonds are directed outwards in major groove 
aligned tetrads (Figure 1), while they are directed 
inwards In minor groove-aligned tetrads (Sup- 
plementary Material, Figure Si). There is steric 
crowding between the inwardly pointing sugars in 
the minor groove-aligned tetrads, resulting in base- 
sugar and sugar-sugar Interactions which severely 
buckle the planes of the tetrads.* 0 ' 13 By contrast 
there is no such crowding between the outwardly 
pointing sugars in the major groove-aligned mixed 
tetrads, resulting in a planar arrangement of bases 
within the tetrad. Several challenges remain 
despite our identification here of a major groove- 
aligned slipped AT. AT tetrad (Figure 1(d)) *t 
one end oVa quadruplex. Can one"identify ' for- 
mation of a major groove-aligned direct AT. AT 
tetrad (Figure 1(c)), and can either the direct or 
slipped AT- AT tetrads be positioned in the 
interior of the quadruplex? 

McGavin 31 was the first to recognize the poten- 
tial for self-pairing of Watson-Crick G C and A T 
pairs to form dyad axes related major groove- 
aligned CCGC and A T A T tetrads. Such tet- 
rad-based quadruplexes could play a role In strand 
exchange between two homologous duplexes 
during genetic recombination. Wilson 112 next pro- 
posed a quadruplex-based model for formation of 
reciprocal heteroduplexes from their homologous 
duplex counterparts. In this model homologous 
duplexes initially associate through GCGC and 
AT- AT tetrad formation along their minor 
groove edges, placing complementary strands 
opposite each other. Strand exchange can then 
occur following a 90 w rotation of each base, such 
that the base-pairs now face each other through 
G C'G'C and AT* AT tetrad formation along 
their major groove edges. Experimental $rudies ,va 
have provided evidence for quadruplex formation 
by poly (CA)„. poly (TG) M repeats, while compu- 
tational studies** have established that such strand 
exchange quadruplexes, as well as quadruplex- 
duplex junctions, are stereochemical^ robust and 
form stable entities. Thus, a future challenge 
would be to design a DNA quadruplex consisting 
solely of G C- G C and AT-AT tetrads and 
attempt to trap structures where the alignment is 
through the minor groove on the one hand, and 
through the major groove on the other. 



Materials and Methods 

Preparation of unlabeled and uniformly "C, "re- 
labeled DNA 

The unlabeled d(GAGCAGGT) sequence was syn- 
thesized on a 10 ymol scale on an Applied Biosystems 
392 DMA synthesizer using solid phnse ft-cyanoethyl- 
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phosprinamidite chemistry and was subsequently puri- 
fied by high pressure liquid chromatography (H?IC). 

A modified version of the Zlmmer & Crothers* 
procedure aft deiicribed ,w * was used for the enzymatic 
synthesis of uniformly ■YV'N-bbeled d(GAGCAGGT) 
sequence. The in-house prepared uniformly *\V"N- 
labeled dNTPs 19 ^ were used as building block* in the 
in xtilro poIymeii>Mtion reaction catalyzed by murine 
mammary leukemia virus (NfMLV) reverse transcriptase 
(Gibco-DRL), The uniformly »\:,"N-labeled d(GAG- 
CAGGT) B-mer was separated from the unlabeled 24- 
mer template u?ing 22% (w/v) denaturing polyacryl- 
amide electrophoresis. TUe DNA 8-mer bands were 
eluted from the gel by "crush and soak" procedure and 
purified as described above for the non-labeled samples. 

NMR data collection and processing 

NMR data on the d(GAGCAGGT) 8-mer in H 2 0 and 
*HjObu«er (1 M NaCL 2 mM phosphate (pH 66)) were 
collected on a Vfirian 600 MHz unity mOvu NMR spec- 
trometer. Proton assignments are based on homonuctear 
NOESY, correlation spectroscopy (COSY), TOCSY and 
HCCNH- TOCS Y experiments. Data sets were processed 
and analyzed using the FELIX program (Molecular 

Simulation*). 

Two bond ^/m* scalar couplings between imlno and 
amino donors *nd nitrogen acceptors in uniformly 
'YV'N-lflbeled diGAGCAGGT) in 1 M NaCI were moni- 
tored in FINN-COSY"- 22 and H(CN)N(H) 24 ^ contour 
plots using pulse sequences described in the literature. 

Distance restraints 

The distances between non -exchangeable proton9 
were estimated from the buildup curves of cross-peak 
intensities in NOESY spectra at four different mixing 
times (50, 150, :»00 and 250 ms) in 2 H 2 0 and given 
bounds of ±30% with distances referenced relative to 
the cytosine H6-H5 distance of 2.47 A. Exchangeable 
proton restraints are based on NOESY data sets at two 
mixing times (60 jind 200 ms) in H a O. Cross-peaks Invol- 
ving exchangeable protons were classified as strong 
(medium to strong intensity at 60 ms), medium (weak 
intensity at 60 ms) and weak (observed only at a mixing 
time of 200 ms) and proton pairs were then restrained, 
respectively, to distances of 3.0(db0,9) A, 4.0(±1.2) A and 
6 0(*Vd) A. Since- the experimental NMR data are con- 
sistent with a dhneric quadruplex motif, non-cry sono- 
graphic symmetry restraints were imposed on all heavy 
atoms. 



Structure celculittlone 

The structure of the d(GAGCAGGT) in 1 M NaCI was 
determined by molecular dynamics (MD)-simulated 
annealing computations driven by NOE distance and 
hydrogen bondinj; restraints using the X-PLOR package, 
version 3.8.™ At :he initial stage of the refinement, tor- 
sional molecular dynamics was undertaken at high tern 
perature. The molecules were equilibrated at 20,000 K 
(30,000 steps over 3 ps) and then cooled very slowly to 
1000 K (40,000 steps over 20 ps). The potential energy 
function included a repulsive force field, NOE and 
hydrogen bond distance restraints, glycoside bond (x) 
dihedral angle restraints and a non-crystallographic sym- 
metry potential. The force constant for NOb distance 
restraints was maintained at a value of 30 kcnl mol 1 



A % while for hydrogen bonds restraints the value wos 
50 kcal mol 1 A X All NOE distance restraints were con- 
sidered as ambiguous and treated with the "sum" aver- 
aging option.*'* Dihedral angle restraints (210(440)^, 
with force constant of 50 kcal mol 1 rad *) were 
imposed on glvcosidic torsion angles for the residues A2, 
C4, A5, G6, G7 and TB shown experimentally to adopt 
tinii conformations, Dihedral angle restraints (65(±40r, 
with force constant of 50 kcal mol 1 rad ') were 
imposed on glycoside torsion angles for the G3 residue 
shown experimentally to adopt syn conformation. The 
force constant for non-crystallogrnphic symmetry was 
maintained at 30 kcal mol" 1 A" a . 

These compulations were followed by lower tempera- 
ture Cartesian apace molecular dynamics guided by the 
hydrogen bonding and NOE distance restraints with 
changes in the potential energy function: the repulsive 
force field was replaced with Lennard-Jones potentials 
and planarity restraints were included for tetrad planes 
with low weights of 5 kcnl mol 1 A a and 10 kcal mol 
A ~. respectively r>' ,rir >p *Viia *t »u« iw« 
structures were further cooled from 1000 K to 300 K 
(20,000 steps over 10 ps) and minimized until the gradi- 
ent of energy was less than 0.1 kcal mol " 1 . It should be 
noted that computations repeated without plana rity 
restraints resulted in the same low energy structures, but 
exhibited a lower convergence rate. 

The refinement protocol started from 60 different 
initial structures. The initial structures were generated as 
sets of two chains, each eight nucleotides long, random- 
ized for all dihedral angles, and separated by space inter- 
vals of 50 A. The convergence rate following dynamics 
was good for the case where the computations were 
guided by hydrogen bonding restraints associated with 
the topology shown in Figure 2(a): Sixteen structures out 
of 60 emerged with the same fold and patrwise r.m.s.d. 
values less than 1.0 A between members of the group. 
Non-converged structures were separated from that 
group by large gaps (in total more than 400 kcal) in all 
components of the potential energy (van der Waals, 
NOE violations, covalent geometry). 

The ten converged distance refined structures corre- 
sponding to the folding topology shown schematically in 
Figure 1(a) were used as the starting point for sub- 
sequent X-PLOR based energy minimization with back- 
cilcublion of the NOESY spectra. The relaxation matrix 
wa» set up for the non exchangeable protons, with the 
exchangeable imlno and amino protons exchanged for 
deuterons- A total of 712 non-exchangeable intensity 
values from NOESY data sets at four mixing times in 
2 H z O buffer (178 non-exchangeable intensities per mix- 
ing time) were included with force constant of 500 kcal 
mol The plnnarity restraints were set to very low 
values of 1,0 kcal A~ a for the G GGG tetrad and 
0.5 kcal A a for the A T- A T and G-C-G C tetrads. The 
distance restraints were retained with 30% bounds and 
the same weights as before. During minimization, the 
NMR R factor (R l/ft ) improved from the initial value of 
12% to 7.4% while retaining structure convergence and 
stereochemistry, 



Coordinates deposition 

Coordinates (accession number IJvc) of the dlmeric 
d(GAGCACCT) quadruplex have been deposited in the 
RCSB Protein Data Bank. 
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